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BIT ALLOCATION FOR ENCODING 
TRACK INFORMATION 

FIELD 

[0001] Embodiments of the invention relate to encoding track information in codecs. 
Specifically, the disclosure relates to bit allocation for jointly encoding track information from 
multiple subframes. 

BACKGROUND 

[0002] Speech compression in traditional voice communication systems provides more efficient 
use of bandwidth than sending uncompressed signals, because the systems are enabled to transfer 
more data within the same bandwidth allocation. Speech compression is a technique for 
representing an analog speech signal in digital format with as few bits as possible, while 
preserving signal quality. The number of bits used to represent the speech signal directly affects 
the bit rate of the encoder, with higher bit rates requiring more bandwidth. Thus, a lower 
encoding bit rate would generally result in a more efficient use of bandwidth. 
[0003] Furthermore, speech encoders and decoders typically operate under time constraints in 
which the compression/decompression should occur. Thus, the goals in the design of a speech 
codec (coder/decoder) are generally to minimize the bit rate of the encoded speech signal, while 
reducing the complexity of the speech compression algorithms and minimizing delay in 
encoding. Although a design seeks to reduce complexity and minimize delay, another goal is to 
preserve the speech quality of the signal. 

[0004] Speech compression standards are often used as design guides in designing speech 
codecs, because many of the above issues have been contemplated in the standards, and codecs 
implementing the standards may be interoperable with other devices supporting the standards. 
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The speech compression standards may set forth a bit allocation scheme for the encoder, such as 
defining a frame of data with certain bit positions within the frame having a standard meaning. 
Such frames may be subdivided into two or more subframes, and each subframe may include 
several data tracks. The bit allocation scheme encodes the parameters as a sequence of bits and 
hence encodes pulse positions and/or signs as a sequence of bits. The whole of the information 
represented in the various pulses of the various tracks defines an excitation vector used to encode 
the data in a set number of digital bits. A receiving decoder uses the encoded bit stream to 
generate excitation vectors to decode the compressed signal. 

[0005] Some standards may provide for an uneven distribution of pulses among the tracks, such 
that one or more tracks may include an extra pulse(s). Thus, the encoder should include an 
indication of which track(s) include extra pulse(s) in the frame sent to the decoder. This 
indicator is often referred to as indicating the starting track, meaning the first track with extra 
pulses. The track indicator may be a set of bits sent with each subframe indicating which track 
in the subframe is the starting track. 

[0006] One compression standard is the Algebraic-Code-Excited Linear-Prediction (ACELP), 
with a derivative being Conjugate-Structure ACELP (CSACELP). The ITU-T (International 
Telecommunication Union Telecommunication Standardization Sector) has defined the G.729 
digital transmission system standard based on CSACELP. As with other speech coding 
standards, ITU-T G.729 specifies a coding bit rate. G.729 is defined to operate at 8.0 kbit/s for 
compression of normal speech signals, with extension G.729E defined to operate at 1 1.8 kbit/s 
for compression of a wider range of signals, including speech with noise, music, etc. G.729E 
defines 2 subframes, each with 5 tracks, each track containing 8 pulse positions, for a total of 40 
pulse positions in which to define an excitation vector in the frame. The pulse positions are 
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interleaved in such a way that track TO has positions (0, 5, 10, . . .), Tl has positions (1, 6, 1 1, 
...), and so forth. 

[0007] G.729E specifies the use of 12 pulses, meaning there will be an uneven distribution of 
pulses across the five tracks. Accordingly, two of the five tracks have three pulses, and the other 
three have two pulses. One of the two tracks with an extra pulse is identified as a starting track, 
which is indicated to the decoder for proper decoding alignment. A starting track indicator is 
sent at the beginning of each subframe, using 3 bits each to indicate one of the five tracks. But 
note that because three bits are used, there will be three unused combinations. This means that in 
a frame with two subframes, there will be three unused combinations in each of the two starting 
track indicators. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0008] The disclosure includes various illustrations by way of example, and not by way of 
limitation in the figures and accompanying drawings, in which like reference numerals refer to 
similar elements. 

[0009] Figure 1 is one embodiment of a flow diagram of aspects of a bit allocation algorithm. 

[0010] Figure 2 A is a prior art embodiment of a frame of encoded data. 

[0011] Figure 2B is one embodiment of a frame of encoded data. 

[0012] Figure 3 is one embodiment of a joint encoding reference table. 

[0013] Figure 4 is one embodiment of a joint encoding reference table. 

[0014] Figure 5 is one embodiment of a block diagram of an encoding unit. 
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DETAILED DESCRIPTION 

[0015] Methods and apparatuses for bit allocation are described. An algorithm is used to jointly 
encode track information from among multiple subfirames of a frame of data to be encoded. 
Joint encoding may lower the encoding bit rate, and thus preserve bandwidth. 
[0016] Speech compression deals with receiving a data signal in analog form and filtering and 
encoding the signal in a bitstream to send to a receiver/decoder. Each bit in the bitstream 
transmits information to the decoder based on its position in the bitstream. The data signal and 
encoded bitstream are typically dealt with in blocks of data, or frames, that may be divided into 
multiple subframes. Each subframe of data is filtered and encoded to result in a stream of bits 
that is received at a decoder. 

[0017] For example, speech codecs may be designed based on standards that define a frame of 
data as including a number of pulse positions within a specific sequence of bits in the frame. The 
pulses within the sequence are used to derive an excitation vector to use in decoding the 
compressed signal. Thus, the bit allocation defined between a codec pair allows for transmission 
of compressed signals between the encoder and decoder. 

[0018] Figure 1 illustrates one embodiment of a flow diagram of aspects of a bit allocation 
algorithm. The standards may provide for an uneven distribution of pulses among the tracks, 
such that one or more tracks may include an extra pulse(s). Thus, the encoder should include an 
indication of which track(s) include extra pulse(s) in the frame sent to the decoder. This 
indicator is often referred to as indicating the starting track, meaning the first track with extra 
pulse positions. The track indicator may be a set of bits sent with each subframe indicating 
which track in the subframe is the starting track. 
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[0019] As indicated above, one compression standard is the Algebraic-Code-Excited Linear- 
Prediction (ACELP), with a derivative being Conjugate-Structure ACELP (CSACELP). The 
ITU-T (International Telecommunication Union Telecommunication Standardization Sector) has 
defined the G.729 digital transmission system standard based on CSACELP. As with other 
speech coding standards, ITU-T G.729 specifies a coding bit rate. G.729 is defined to operate at 
8.0 kbit/s for compression of normal speech signals, with extension G.729E defined to operate at 
11.8 kbit/s for compression of a wider range of signals, including speech with noise, etc. G.729E 
defines 2 subframes, each with 5 tracks, each track containing 8 pulse positions, for a total of 40 
pulse positions in which to define an excitation vector in the frame. The distribution of pulse 
positions is such that track TO has positions (0, 5, 10, . . .), Tl has positions (1,6, 11, . . .), etc. 
[0020] G.729E specifies the use of 12 pulses, giving an uneven distribution of pulses across the 
five tracks, with two of the five tracks having three pulses, and the other three having two pulses. 
One of the two tracks with an extra pulse is identified as a starting track, which is indicated to the 
decoder for proper decoding alignment. A starting track indicator is sent at the beginning of 
each sub frame, using 3 bits each to indicate one of the five tracks. But note that because three 
bits are used, there will be three unused combinations. This means that in a frame with two 
subframes, there will be three unused combinations in each of the two starting track indicators. 
The end result will be that the equivalent of an extra bit of empty information is sent with every 
frame. This may be avoided by jointly encoding the starting track information of the separate 
subframes into a single track indicator that indicates the starting track for multiple subframes. 
[0021] For example, an encoder will identify which of the multiple tracks in each subframe is to 
be the starting track for each respective subframe. This information is transmitted as a joint track 
indicator to the decoder to ensure proper alignment at the decoder. As illustrated in Figure 1, a 
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starting track is identified for both the first and the second subframes, 102 and 104. Although 
two subframes are illustrated in Figure 1, the skilled artisan will appreciate that a frame 
subdivided into more subframes can be used. Thus, additional steps in Figure 1 might be to find 
a starting track for any additional subframes that might be present in a frame of data transmitted 
between an encoder and decoder. 

[0022] Starting track information is jointly encoded, 106. Where a starting track was identified 
for each subframe of the frame of data, a single indicator is generated to represent the starting 
track for all subframes. Thus, in contrast to sending a starting track indicator with each 
subframe, a single starting track indicator that indicates the starting track information for all 
subframes is sent with the frame. One way to format such a joint indicator is to pre-define a set 
of values of a joint track indicator that will correspond to a set of ordered pairs. For example, 
instead of sending a starting track indicator at the beginning of a first subframe to indicate track 
TO, and a second starting track indicator at the beginning of a second subframe indicating track 
T2, a joint track indicator could be sent to the decoder that the decoder will interpret as 
indicating an ordered pair (TO, T2). For example, the decoder may include a look-up table 
(LUT) that will use the joint indicator as a search key. In another embodiment, the information 
could be hard-coded into the processing software on the decoder, so that receiving a joint track 
indicator of a certain value will cause a particular code branch or process to execute. The 
decoder may then proceed to decode using this information to align the first subframe on starting 
track TO, and the second subframe on starting track T2. 

[0023] For further illustration, consider G.729E that defines five tracks per each of two 
subframes. Because there are five tracks, three bits are required to send a starting track indicator 
(2 2 <5<2 3 , so 3 bits needed). In the previous example, the starting track indicator for the first 
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subframe could be represented by a three-bit value, e.g., [000], and the starting track indicator of 
the second subframe by, e.g., [010]. To send the starting track information for both subframes 
would then require six bits. However, notice that if there are five tracks in both the first 
subframe and second subframes, there will only be 5x5=25 combinations of ordered pairs needed 
to represent each possible combination of starting tracks for the two subframes. This would only 
require five bits (2 4 <25<2 5 ) instead of the six bits used to send the information in separate 
subframes. Thus, a five-bit joint starting track indicator of value, e.g. [00010] could be used to 
correspond to the starting track pair (TO, T2). The use of a five-bit joint track indicator can save 
1 bit per frame over using two separate three-bit indicators, and still transmit the same 
information. This provides for a compression/decompression algorithm with equivalent quality 
as the standard of G.729E, with no greater complexity, but with a savings of one bit per frame. 
A possible predefined table of values of track indicator to ordered pairs for two subframes is 
shown in Figure 3. Figure 4 illustrates a possible predefined table of values for a track indicator 
that corresponds to ordered sets for three subframes, in the same manner as discussed above. 
[0024] Figure 2 A is a prior art embodiment of a frame of encoded data. Frame 200 includes 
subframes 210 and 220. Each subframe 210 and 220 has a body 212 and 222, respectively. In a 
signal coding system, the body may have separate tracks of pulses encoded by an encoder from 
which a decoder will construct an excitation vector. Note that in Figure 2A, there is a track 
indicator 0 211 for subframe 210 to encode the information necessary to indicate to a decoder an 
identified track. The identified track may be, for example, a starting track of multiple tracks in 
body 212. The decoder may use the identified track, for example, for purposes of alignment. 
There is a separate track indicator 1 221 sent for subframe 220. It will likewise indicate an 
identified track, such as a starting track, of multiple tracks in body 222. 
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[0025] Figure 2B is one embodiment of a frame of encoded data. Frame 250 includes 
subframes 260 and 270, which include bodies 262 and 272, respectively. Subframes 260 and 
270 may include multiple tracks in each subframe, one of which may be indicated for each 
subframe. Rather than sending a separate track indicator for the separate subframes, frame 250 
includes joint track indicator 25 1 . Joint track indicator may be viewed as logically separate from 
the subframes of frame 250, or as a logical part of one of the subframes of frame 250. If viewed 
as a logical part of one sub frame, frame 250 may simple be defined to have a sub frame that is 
encoded with a greater number of bits than the other sub frames in the frame, and this sub frame 
having the information of joint track indicator 25 1 . 

[0026] With a single track indicator, the identified track for all subframes in a frame (in this 
case, identified tracks for both subframes 260 and 270) are indicated with a single indicator. The 
single indicator may provide for better use of bandwidth. Bandwidth can be preserved by using 
the principle that for certain combinations of numbers of tracks and subframes, a single track 
indicator may indicate one of the combinations with fewer bits than would be required to 
indicate a certain track for every subframe in a frame with separate track indicators. 
[0027] Figure 3 is one embodiment of a joint encoding reference table. Table 300 includes rows 
31 1-320 and columns 301-303. Table 300 could be included as a Look-Up Table (LUT) or other 
software structure in the software for a codec. Table 300 would thus be stored on a codec for use 
in the codec during encoding/decoding operations. Table 300 may also be generated at start-up 
and placed in volatile storage. In one embodiment table 300 does not exist as an accessible code 
structure in memory, but rather the information shown as table 300 is hardcoded directly into the 
codec processing software. In this case the software, when executed, may directly use the joint 
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track indicator to determine where to branch (as in a series of conditional branches), or a 
particular process to operate, or provide input to a particular process that will be executed, etc. 
[0028] In one embodiment table 300 has a joint starting track indicator, row 301. The joint 
starting track indicator 301 illustrates a bit pattern, or binary value, that corresponds to a starting 
track indicator that an encoder will send to a decoder. Thus, on the decoder side, the decoder 
will interpret joint starting track indicator 301 as indicating that the starting track of subframe 0 
is the starting track indicated in subframe 0 starting track, column 302. A corresponding entry in 
subframe 1 starting track, column 303, indicates the starting track of subframe 1. Note that in 
this fashion an entry in joint starting track indicator 301 corresponds to an ordered pair 
consisting of (subframe 0 starting track, subframe 1 starting track). 
[0029] The rows 31 1-320 indicate what ordered pairs may correspond to what joint track 
indicator bit pattern. Thus, for example, at row 31 1, joint track indicator of value "00000" 
corresponds to ordered pair (TO, TO), at row 315 joint track indicator of value "01 101" 
corresponds to ordered pair (T2, T3), and so forth. 

[0030] Note that while columns 301-303 make reference to a "starting track," the same 
procedure could be used for any track of interest. The expression "starting track" is thus used 
simply by way of illustration and not limitation. Also note that the entries in table 300 are set 
forth in logical numerical order, with the lowest numbered ordered pairs corresponding to the 
lowest numbered joint starting track indicator. However, other associations of joint starting track 
indicator and ordered pairs may be made, depending on the system implementation and 
software/firmware on which the teachings herein may be implemented. 

[0031] Figure 4 is one embodiment of a joint encoding reference table. As with Figure 3, table 
400 may be stored in a codec system and/or coded directly into the software/firmware 
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implementing the codec functions. Table 400 illustrates that an encoder will associate a 
particular bit-set, represented in joint starting track indicator, 401, as a short-hand representation 
for an ordered set, as in columns 402-404, and a receiving decoder will understand that the 
particular ordered set of columns 402-404 is represented by the particular bit-set. Thus, Figure 4 
points out one possible allocation of bit patterns to indicate starting tracks in a system that has 
three subframes. 

[0032] Note that other configurations may be used beside those illustrated in Figures 3 and 4. 
There may also be many configurations that would not necessarily save bandwidth by using 
fewer bits to encode the same information, for example, a frame of two subframes with three 
tracks per subframe (separate starting track indicators for the two subframes would require two 
bits each (2 1 <3<2 2 ), and a joint track indicator would require four bits (2 3 <9<2 4 )). Such a 
configuration could be used for purposes other than bit-saving, such as convenience, or use in 
conjunction with a system that uses the procedure to save bits. 

[0033] Figure 5 is one embodiment of a block diagram of an encoding unit. System 500 
represents an overview of major system elements of an encoding unit. System 500 may include 
many sub-elements in each of the elements shown in Figure 5. In one embodiment system 500 is 
a system that complies with ITU-T G.729E. 

[0034] Input signal 501 is the target signal to transmit. Input signal 501 is received at pre- 
processing 510 that may perform, e.g., signal scaling and/or high-pass filtering. Such pre- 
processing may perform functions such as reducing the possibility of overflow, or removing 
undesired frequency components. 

[0035] A pre-processed signal is then filtered by filtering 520. Filtering 520 may perform 
functions such as, but not limited by: linear prediction, autocorrelation, pitch analyses, impulse 
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response computations, adaptive codebook search, fixed codebook search, etc. In one 
embodiment filtering 520 includes codebook search engine 521 to perform searches of the 
codebooks to determine how to encode the signal, and what information will be needed to 
indicate to a corresponding decoder how to extract the signal. The resulting signal will include 
the pulse positions for encoding and decoding the target signal. 

[0036] In one embodiment system 500 includes gain quantization 530 to vector quantize 
codebook gains. For example, a fixed-codebook and/or a pitch gain may be vector quantized 
using 7 bits. A gain codebook search is done by minimizing the mean-squared weighted error 
between original and reconstructed speech. 

[0037] Parameter encoding 540 includes the process of creating the encoded bitstream that will 
become transmitted bitstream 55 1 sent to the decoder. The format of the bitstream will be 
dependent on the configuration of system 500. For example, in one embodiment transmitted 
bitstream 551 will be created that includes two subframes in a frame, with the first bits of the 
frame to include a joint starting track indicator for both subframes. Transmitted bitstream 551 
may be transmitted over some sort of transmission medium known in the art, for example, but 
not limited to, a wireline transmission line, a wireless communication transmission module, etc. 
[0038] Reference herein to "one embodiment 1 ' or "an embodiment" means that a particular 
feature, structure or characteristic described in connection with the embodiment is included in at 
least one embodiment of the present invention. Thus, the appearance of phrases such as "in one 
embodiment," or "in another embodiment" describe various embodiments of the invention, and 
are not necessarily all referring to the same embodiment. Besides the embodiments described 
herein, it will be appreciated that various modifications may be made to embodiments of the 
invention without departing from their scope. Therefore, the illustrations and examples herein 
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should be construed in an illustrative, and not a restrictive sense. The scope of the invention 
should be measured solely by reference to the claims that follow. 
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